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Abstract. We link two phenomena concerning the asymptotical behavior of 
sto-chastic processes: (i) abrupt convergence or cut-off phenomenon, and (ii) 
the escape behavior usually associated to exit from metastability. The former 
is characterized by convergence at asymptotically deterministic times, while 
the convergence times for the latter are exponentially distributed. We compare 
and study both phenomena for discrete-time birth-and-death chains on Z with 
drift towards zero. In particular, this includes energy-driven evolutions with 
energy functions in the form of a single well. Under suitable drift hypotheses, 
we show that there is both an abrupt convergence towards zero and escape 
behavior in the other direction. Furthermore, as the evolutions are reversible, 
the law of the final escape trajectory coincides with the time reverse of the law 
of cut-off paths. Thus, for evolutions defined by one-dimensional energy wells 
with sufficiently steep walls, cut-off and escape behavior are related by time 
inversion. 

keywords: cut-off and metastability and hitting time and exit-times and 
reversibility 

1. Introduction 

The cut-off phenomenon, first identified and formalized in the early eighties [3J 
3 , is by now a well studied feature of Markov processes (see [23 [37J EH EE| f° r 
reviews). The phenomenon refers to an asymptotically drastic convergence of a 
family of stochastic processes labeled by some parameter a. As a — > oo, a 

suitable distance between the laws P [X^(t) € •) and the corresponding invariant 
measures (•) converges, in macroscopic time units, to a step function centered 
at deterministic times i™'. More precisely, the function a —> t^ ut is such that 
the distance is asymptotically maximal for times smaller than t™* — o(^ ut ) and 
asymptotically zero for times larger than i^ ut + o(t£ ut ) (Figure HJ). The term cut-off 
is naturally associated to such an "all/nothing" or "1/0" behavior, but it has the 
drawback of being used with other meanings in statistical mechanics and theoretical 
physics. Alternative names have been proposed, including threshold phenomenon [3] 
and abrupt convergence [55 . It has also been called abrupt switch in a precursor 
work [1] based on mixing times rather than distances between measures. 

Despite growing interest among probabilists, cut-off convergence has been largely 
ignored within mathematical statistical mechanics ( |38j is the only exception we 
know of, besides our previous announcement [9] and the thesis of one of us [8]). 
As a tentative justification of this situation, we stress the fact that cut-off studies 
have often been performed in a relatively elaborated setting that involves a certain 
amount of technical complexity. Indeed, cut-off was initially studied in evolutions 
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Figure 1. Distance to equilibrium during abrupt convergence 



denned by group actions (eg. card shuffling) [21 IS]- Emphasis was, therefore, put 
into Fourier techniques [26]. Subsequently [H[5l[23l[28l[29l|3T], the phenomenon was 
associated to the existence of "strong uniform (coupling) times" and to symmetry 
and multiplicity properties of second eigenvalues of transition matrices, see [2 7) for 
a review. It has also been associated to exponential speed of convergence in [53] 
, [1] and [33]. This fostered the use of notions and techniques issued from diverse 
mathematical fields, ranging from combinatorics to functional analysis, see [47l I48j . 
The mathematical richness of these studies may have prevented potential audiences 
from fully appreciating the probabilistic meaning of the phenomenon. 

In all these references, the existence of the phenomenon is proven through a 
precise estimation of the sequence i^j ut of cut-off times. This precision comes at a 
high technical price and is largely responsible for the variety of treatments found in 
the literature. It is clear that the theory would benefit from the disentanglement of 
the proof of the existence of a cut-off from the more delicate determination of the 
actual cut-off times. A contribution in this direction is the work done in relation 
with the conjecture formulated by Y. Peres in 2004. The conjecture states that in 
different natural settings a necessary and sufficient condition for a process to have 
cut-off is that the product of mixing time and the spectral-gap tends to infinity. 
It was proved that the conjecture is true for Markov chains for the L p norm with 
p > 1 in [22] . It was also proved to be true for birth and death chains in separation 
distance in [30] and in total variation distance in [32] . These articles have been a big 
contribution to the understanding of the phenomenon, yet the condition to observe 
the phenomenon still relies in the spectral structure of the chain and the behavior 
of the distance. In our opinion, this work should be complemented by an approach 
based on the analysis of typical trajectories, which would reveal, more clearly, the 
underlying probabilistic mechanism leading to cut-off among all possible patterns 
of convergence. The present work points in this direction. 

An alternative, more probabilistic approach proposed in the last decade |52[ 1531 
I54[ 140] offers some advantages regarding the above criticism. In this approach, cut- 
off is associated to the existence of an appropriate drift in stochastic processes or 
sampling schemes, and arguments are based on the behavior of hitting times [36] 
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I37[ 14]. Furthermore, the approach includes some general criteria for the existence 
of cut-off as illustrated by Proposition 13.11 below (taken from [40]). The resulting 
description offered some tantalizing contrast with another, quite different type of 
convergence associated to metastability. This observation triggered our research. 

From a physical point of view, a system exhibits metastability if it stays trapped 
for exceptionally long times in a state different from true equilibrium (eg. super- 
cooled water). The final transition to equilibrium is rather sudden and happens at 
"unpredictable" times. Mathematically, it is the behavior expected for evolutions 
designed to find a minima x + of some energy function H(x), when the system is 
"captured" by a local minimum at x~ (Figure [2]). 




x + 



Figure 2. The local minimum at x leads to metastability 

Unlike cut-off, metastable behavior has been thoroughly studied within rigorous 
statistical mechanics. The basic scenario was described in a seminal paper [21j that 
is contemporary of the first cut-off papers: A system driven downhill towards x~ 
will stay for a long time around this point until suddenly, at some random instant 
i csc , an escape trajectory unfolds, that takes it over the barrier at x° and down to 
the true minimum x + . The law of i esc converges to an exponential as the "barrier 
height" x° — x" grows in a suitable manner. In a typical trajectory, the distance 
to x + follows curves similar to those of Figure [TJ except that the jump-times t a are 
exponentially distributed rather than determinist. The parameter a diverges with 
the height and/or steepness of the small well. 

This picture, borrowed from the theory of randomly perturbed dynamical sys- 
tems [34] , has been called "path approach to metastability" . It has subsequently 
been refined and systematized [43] HU [51] ; see [45], [39J [46] for overviews. Escape 
from metastability happens through a very narrow choice of trajectories, known as 
the exit tube, which has been determined in a very precise manner for a number of 
spin dynamics [121 [50, 41, 42] [351 [55] • A more recent and efficient alternative rigor- 
ous treatment of metastability is based on potential theory p~5l fT7l IT6l [20l [71 [181 fl~9] . 
The approach relies on an optimization principle which allows the determination of 
metastable behavior through suitable trial functions, without an in-depth study of 
trajectories (see [T^l H31 [H] for reviews). 

Cut-off and escape behavior show, therefore, both similar and contrasting fea- 
tures. On the one hand, both are asymptotic phenomena: they show up when a 
suitable parameter is sufficiently large and their very definitions rely on limit prop- 
erties. On the other hand, while both convergences are sudden, the laws of the 
convergence times are antipodal: deterministic for cut-off and exponential (ran- 
domness without memory) for escapes. There are also differences in focus: cut-off 
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studies emphasize distance between measures, while exit times are the object of 
metastability studies. The message of the present paper, however, is that in some 
natural sense they are often intertwined phenomena. 

In order to exhibit this intertwining, we adopt a relatively simple, but sufficiently 
rich, framework where both the probabilistic approach to cut-off of [52, 53, 54l |40] 
and the path approach to metastability of [34] can be applied. We consider birth- 
and-death chains with a drift (towards the origin) and focus on hitting times for 
excursions along and against the drift. Under reasonable hypothesis on the drift, 
we show that cut-off behavior characterizes the first type of excursions, where the 
latter requires exponential escape times. If an energy function is associated to the 
drift, this results can be interpreted as saying that, if the well is sufficiently steep 
or high (and its walls are sufficiently smooth), climbing along a wall requires an 
exponential time while descents are abrupt. Furthermore, by reversibility, both 
types of excursions are related by time inversion: the law of escape trajectories 
equals the time reverse of the law of cut-off paths. In particular, our results apply 
to rather general recurrent random walks and to the Ehrenfest model. The latter is 
a kind of border line case, as it is associated to a well whose walls are not as steep 
as required in standard metastability studies. 

Our results explain, in part, why cut-off phenomena has been largely ignored 
by physics-minded researchers: it corresponds to the "trivial" downhill evolution 
of a system falling in an energy well. At the same time, our work suggests a more 
detailed decomposition of metastable evolutions. Coming back to Figure [2] the 
escape from the local minimum at x~ to the absolute one at x + can be decomposed 
in three stage: (i) the exit from x~ to the left- vicinity of a; , (ii) the transition to 
the right of x°, and (iii) the descent to x + . The exponential time typical of the 
transitions to equilibrium — from x~ to x + — is, in fact, determined by stage (i). 
The time scale of the other two stages is so much smaller that it dissapears under 
the usual time rescaling. Evolutions in more complicated energy landscapes can, in 
principle, be decomposed into a sequence of exponentially distributed uphill pieces 
followed by cut-off-like downhill excursions. Only the uphill stages survive the usual 
time rescaling, and the resulting theory should merge with existing metastability 
studies gag! EE]. 

Furthermore, the equality, modulo time-reversal, of the laws of up- and downhill 
trajectories clarifies the escape scenario: The final excursion taking the system 
from the bottom to the top of a well is the reversed of the cut-off trajectory that 
would take it in the opposite direction. Hence, this upwards motion is done in 
one stroke, almost without midway hesitations, in an asymptotically deterministic 
manner. The exponential law of the exit time is due to the fact that the system 
spends an exponential time around the bottom of the well till the atypical uphill 
excursion takes place. 

Our results suggest a profitable association between the cut-off and metastability 
research communities. We warn, however, that the intertwining between cut-off 
and exponential escape times is by no means universal. Exponential escape times 
are a more robust phenomenon that requires uphill time scales much larger than 
downhill ones. But the latter need not be cut-off like. Wells whose walls are rugged, 
or not strictly monotone, yield examples where the downhill time is not abrupt 
while the uphill law remains exponential. In this sense, knowing that downhill 
excursions exhibit cut-off conveys more detailed information than knowing that 
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the opposite evolution is escape-like. Nevertheless, it is reasonable to expect that 
methods devised to study one of the phenomena might be adaptable to the study 
of the other. This can be of particular interest, for instance to study evolutions 
involving bidimensional — or multidimensional — wells. 

2. Definitions and results 

2.1. The two types of behavior. Cut-off and escape behavior, will be studied 
at the level of hitting times. Both types of behavior are asymptotic, in the sense 
that they are characterized by what happens when a certain parameter a diverges. 
Let us start with the relevant definitions. 

Definition 2.1. (i) A family of random variables exhibits cut-off behav- 
ior at mean times if 

(I) U(a) Pmha : 1 

^ ' E [[/(«)] a^oo ' 

[equivalents, lim^oo P(U^ > cEp^}) = 1 for c < 1 and for c> lj. 
(ii) A family of random variables exhibits escape-time behavior at its mean 
times if 

The original definition of cut-off [TJ HJ [2] is in terms of the variational distances 
of invariant and process measures. We pass to variables because the comparison 
with escape behavior is possible only at the level of hitting times (the measures 
asymptotically associated to metastable escapes are invariably reduced to delta-like 
measures). The connection between both types of definition — in terms of measure- 
theoretical distances and of hitting times — has been explored in [3D] , where the 
following is concluded: 

(i) If the variable correspond to an absorption time, both definitions are 
equivalent, 

(ii) For a positive recurrent Markov chain X a (t) on N with invariant probability 
measure 7r and initial state a, the chain admits a cut-off at time t a if and 
only if its associated absorbed chain with invariant measure So admits a 
cut-off at the same time. 

2.2. The models. Each of the models considered in the sequel corresponds to 
a family {X^ : a S N} of irreducible discrete birth-and-death chains X^ a > = 
{X^it) : t e N U {0}} on intervals fb, aj := {b, b - 1, . . . , a} C Z. Without loss 
of generality, we choose a > and b < 0. The right endpoint a is the diverging 
parameter that will reveal the asymptotic behavior. We shall consider two cases: 
(i) half well: 6 = 0, which corresponds to a well in the form of a wedge with a 
vertical wall at the origin, and (ii) full well: b tends to minus infinity as a grows, 
at a proportional rate (most parameters involved in the definition of the processes 
will be a-dependent but, to unclog formulas, we shall only exhibit this dependency 
selectively). Chains with initial state z will be denoted x[ a \ 

Each family of chains is characterized by transition probabilities p x to the right 
and q x to the left: < p(x,x + 1) =: p x , b < x < a — 1, and < p(x,x — 1) =: 
q x , b + 1 < x < a. All other transition probabilities are zero. The "waiting 
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probabilities" r x := l — p x — q x , b < x < a may be non-zero, but they play little role 
in the sequel. The different p's and q's may depend on a. For each a the chain is 
positive recurrent and admits a unique invariant probability measure which is 
also reversible and can be explicitly written in terms of the transition probabilities: 



(3) 7T^(x) = 



nf =1 ^ (a) (o) *€M 
nr=x^^ o) (o) xe[b,-i] 



This expression follows readily from the reversibility condition ir^ (x) q x = ir^ (x — 1) p x -i. 
[Note that the values of r x appear only through the normalization constant ir^ (0), 
and hence they are largely irrelevant for the rest of our discussion.] 
We shall focus on the behavior of hitting times 

(4) T(% = min{t>0:X<°>(t)=y} . 

More specifically we shall consider T ( a %, T b % T^ a , T^ b and T^ {aM = min(T ( ^ a , T^ b ) . 
Notice that according to this definition T x ll x = for all x in [6, aj . 



2.3. The drift condition. We impose the following drift condition towards 0. For 
the sake of precision, we state it exactly as needed in proofs. The ensuing comments 
relates it with the picture in terms of energy wells. 

Definition 2.2. A family X^ of birth- and- death process has a strong drift to the 
left towards (we note + -SD) if 

(i) The transition probabilities q x satisfy 

(5) inf inf q x =: K„ > . 

oeNx€[l,o] 

(ii) The constant 

(6) K a := sup feEprJ^J 

x€[l,o] 

satisfies 

(7) 



Likewise, it has a Strong Drift to the right towards (equivalently 0~-SD) if 
(i) 

(8) inf inf p x =: K v > . 

o€Nxe[6,-l] 

(ii) 

(9) K b := sup p x E[T^2 x+1 ] 
satisfies 

K 2 

(10) ^ >0. 

The family X^ is said to satisfy the (bilateral) Strong Drift condition towards 
(0-SD) if it has both 0+-SD and 0~-SD . 
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Let us discuss the meaning of the 0-SD condition. All of the expressions given 
above in terms of hitting times have equivalent forms in terms of the invariant 
measures ir( a \ The connection is provided by the following well known expressions 

(ID E[TiU=^M i ^ X+1 ]=^M 

v ; L J q x ir( a )(x) 1 x ^ x+1 i p x ir( a )(x) 

which, for completeness, are proven in Section [?] below. As a consequence, 

(12) K a = sup , , ; K b = sup , . 

xe[0, a ] 7rW(a;) xe[t,-i] 7rW(x) 

These expressions show that K a and Kt, provide an exponential bound for the tail 
of 7r( a ) . Indeed, by the leftmost identity in (|12p. 



mi nWgx + l,a}) = , _ ^ (a) (g) < , _ J_ 

1 ^ 7rW([x s o]) ~~ TrWflfoa]) " *T„ " 

Hence, upon iteration, 

(14) tt^(x) <Tr^([x,a]) <e- a -" with a a = - log(l - 1/K a ) , 



for < x < a. The rightmost identity in (|12| yields the same inequalities for 
& < x < 0, with b in place of a. 

The 0-SD condition can be transcribed in terms of energy profiles by proposing a 
function H (x) such that p(x, y) oc exp{ — \H{x) — H(y)\\. Then, H(x) — H(x—1) = 
-(l/2)log(p s _i/ 9fl .) and, by ©, ttW (x)/^ (0) = exp{±[H(x) - H (0)]} with the 
upper sign for b < x < and the lower one for < x < a. Thus, from (|14p . 



(15) H(x)-H{0) > 



a a x for < x < a 
a.b x for b < x < . 



In the standard setting for energy-driven dynamics, the steepness a of the walls 
is assumed to grow as the parameter playing the role of a (often the inverse tem- 
perature) diverges. Here we do not require such a drastic behavior. In fact, a is 
allowed to go to zero (that is, K a — ► oo) as long as (|7|) and ifTO]) remain valid. This 
generality allows our results to be applicable to the Ehrenfest model. Condition 
([7| limits K a to grow sub-linearly with a, and (|10p implies an analogous limitation 
for as a function of 6. Indeed, (jTTJ) implies that E[X^ ] < K q K a a and, hence, 
the validity of ([7]) means that 

K K K 2 
(16) ^ < -^A 2 ► 0. 

a E[T^l ] a ^°° 

Note that, in particular, a family X( a ) has + -SD if K a is bounded uniformly in 
a. This is because E[T^ ] > a — > oo. Likewise, it has _ -SD if sup b if b < oo as 
long as 6 diverge with a. 

2.4. Main result and applications. To state the main result we need one further 
piece of notation. For x, y G [6, a], let t^Xx^ = sup{0 < t < TxXy : X x a \t) = x} 

be the instant of the last visit to x before hitting y and T^ly — Tx-ly — T x a \ x ,y the 
time needed to hit y after last visit to x. The following results apply also to the 
particular case 6 = — which we call half well. In this case — which constitutes, in 
fact, the building block of our arguments — every condition involving b is understood 
to be vacuous. 
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Theorem 2.3. Let X^ be a family of irreducible birth- and- death chains on fb, a]. 
Then: 

(1) By reversibility, T^ = T ^ a and = T^ b . (More generally, T^Ly = 

Ty a lx for any x, y e [6, aj.) 

(2) IfX^ has a SD to the left of (0+-SD), resp. to the right of (O'-SD), 
the random variables T^ , resp. T^ , exhibit cut-off behavior at mean 
times. 

(3) Assume X^ satisfies the 0-SD condition and consider the conditions: 

17 1 b r' )i >0 , 1 a , , >0. 



•6 J 



(a) If the leftmost, resp. the rightmost, condition is satisfied, then the ran- 
dom variables T^ a , resp. T^,, exhibit escape-time behavior at mean 
times. 

(b) If both conditions are satisfied, the random variables b , exhibit 
escape-time behavior at mean times. 

Note that, for processes with strong drift towards zero, the conditions (| 1 T[) are 
satisfied if there exist strictly positive constants c\ and C2 such that 

(18) ci < < c 2 , 

As commented in Section [SJ a weaker sufficient condition — also based on magni- 
tudes involving trajectories on one side of the well — is obtained by replacing the 
hitting times in (| 1 T[) by those for the corresponding "half-well" models (see Re- 
mark below). In particular, requirements (|17p are automatically satisfied for 
symmetric wells. 

Let us mention a few applications. 

Simple (non-symmetric) random walk. The simplest application is a random walk 
with transition rates that only depend on the sign of x and are independent of a: 
p x = p + and q x = q + for x > 0, while p~ and q x = q~ for x < 0. Formulas (jlip 
yield, in this case, K a = q + /(q + — p + ) and Kb = p~/(p~ — q~) (hence K q < 2). 
Thus, the resulting family X^ has a + -SD iff q + > p + . Likewise, it has a 0~- 
SD iff p~ > q~ and b — > oo with a. (That is, SD = positive recurrence.) For 
these walks, the difference between the mean times towards and away from zero 
is noticeable. Indeed, E[T„^ ] ~ a /(? + — P + ) while, for the half well (6 = 0), 
E[T,W o ] ~ (q+/p+r. 

Random walk with varying rates. More generally, Theorem 12.31 applies to random 
walks with 

(19) o + := min q r > max p. r =: n + , p~ := min p r > max q r =: q~ 

and b diverging with a. The values of K a and Kf, are as in the previous example 
and, as there, EfT^o] < const x a and E[T ( ^ a ] > const x (q+/p + ) a (for b = 
0). Furthermore, Theorem 12.31 holds even if the wall has a "flat" bottom. More 
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precisely, it holds if q x = p x = 1/2 for a; € \d a , d+] for some d a < and d+ > 0, 
as long as 

(20) ► , ► 

Q a — >oq o a> — ►oo 

and 
(21) 

q + := min q x > max p x = : p + , p := min p x > max q x =: q . 

Indeed, in this case we can take K a — d+ + [q + /(q + — p + )] and Kb — — d~ + 
[P~~/{P~ - <T)]- On the other hand, E[T^ ] > d+ + a[q+/{q+ -p + )], where 
q + = max{q x : x G a]]} and p + = min{p x : x £ [c?+, a]}. An analogous bound 
holds interchanging a b, q + ^ p~ and p + «-> Hence, conditions (|21|1 imply 
the validity of Q and ([TP]). 

For these examples, the observation of [3D] — commented below Definition 1 2. U — 
applies, namely the cut-off of the hitting time at zero is equivalent to the cut-off in 
the original measure-theoretical sense. 

Ehrenfest model. It was introduced by Paul and Tatiana Ehrenfest in 1907 to sup- 
port Botzmann's ideas by showing a simple example where microscopic reversibility 
leads to what can be interpreted as macroscopic irreversibility. The model consists 
in 27V particles distributed in two urns. At each unit of time one of the particles 
is chosen with uniform probability and changed of urn. The process follows the 
number of particles in one of the urns; a dynamic that is equivalent to the pro- 
jection of a random walk on the hypercube {—1, 1} 2N . In this last form, but in 
continuous time, the cut-off aspects of the model have been studied in [29]. The 
escape behavior has been analyzed in [6], in terms of the initial urn model, and 
in 1 1 2 j in its random-walk version for continuous time. 

Our framework applies to the initial discrete-time model. We call TV — * a and 
shift the urn counting by —a, so to obtain a model with drift towards zero. The 
resulting family has state space [—a, a] and transition probabilities 

a — x a + x 

(22) p x = — — and q x = — — . 

Za la 

Unlike our previous examples, both transition rates depends on a and have a drift- 
less limit {p x — q x — 1/2) when a — > oo. Furthermore, some elementary calculations 
(see [S]) show that K a = K- a = l/ir^ (0) ~ y/a. The model has a strong drift, 
however, because E[T^o] = E[xl^ ] ~ a log a. 



3. Sufficient conditions for cut-off and escape behavior 

In this section we develop the sufficient conditions to be used in the sequel to 
prove the behavior of the variables in Theorem l2.3l Here we adopt a rather general 
set-up, not restricted to birth-and-death processes. 



3.1. Cut-off behavior. The following sufficient condition for a family of random 
variables to exhibit cut-off behavior in the sense of Definition 12.11 was first 
pointed out in [40] : 
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Proposition 3.1. A family of random variables U^ a ' exhibit cut-off at mean times 
if 

(23) ' ' 



lim Var, 

a^oo VE[C/( a )l 



\E\UW]) 







The proof is a direct application of Tchebyshev inequality: For Z = 1— Z/w/E [C/( a ^] , 
we have that F(\Z\ > e) < E[Z 2 ]/e = Var(Z)/e. This simple observation allows 
the determination of cut-off behavior without a precise estimation of the cut-off 
times E[£/( a )]. 

3.2. Escape-time behavior. We adapt the "pathwise approach" introduced in [21] 
to a general setting. We consider Markov chains X^ a \t) on countable alphabets 
A a , and we distinguish a ("low") state £ £ A a , ("high") regions H a C A a and 
hitting times 



(24) 



■ r := inf inf{t > : X ( f a) (t) = h] 

-.(a) 



The mechanism through which the variables develop an asymptotically 

exponential law is the existence of a much shorter time scale for excursions in the 
direction Ti a — * £ than vice-versa. Let us be more precise. 

Definition 3.2. We say that the mean time from TL a to £ is much shorter than 
that from £ to Ti. a , and we write 

(25) E [ r S-*l < E [ T t ] 

if the following conditions are satisfied: 
(1) Different time scales: 

sup^E^ 



■>n a l > 



(26) 



lim 

a — >oo 



= o . 



E T, 



(2) Uniform integrability of the sequence 



(27) 



(28) 



T, 



(a) 



E 



T, 



(a) 



This means that for all e > there must exist L e > such that 



S (a) g(a) > K 



< e 



V a 



These conditions imply escape behavior: 

Theorem 3.3. Let (t) be a family of irreducible and positive recurrent Markov 
chains on a countable alphabet A a . If £ £ A a and TL a C A a are such that 
E[^h _ ►/] ^ E[T^ W 1, then the family exhibits escape-time behavior at 

mean times. 

Let us first summarize the central idea of the proof. It amounts to exhibit the 
asymptotic factorization 



(29) P 



T, 



(a) 



E[T t 



(a) 



> S + t 



T, 



(a) 



T, 



(a) 



> s 



(a) 



> t 







Ha 
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This follows from the existence, by condition (j26|) . of a time scale that is interme- 
diate between the hitting times in both directions. Indeed, 



(30) 

is such that 
(31) 

and 
(32) 



Aa = jm a J n J sup E[T^j 



Pa 



E 



T. 



(a) 



. 



Note that, by Markov inequality, this last equation implies that 



(33) 



' (walk visits in an interval of length A a ) 



T, 



(a) 



time units, 



If a process started at £ has not hit the region H a after s E 

then, due to (f33|) . it will almost surely fall down to £ within an interval of length A a . 
Once £ is visited, the process recovers the initial law, by Markovianness. Therefore, 

F (^>( S + 0E[^J) « 

(34) F(T^ na >sE[T^ a })p(T^ a >(t-A a )E[T^ na ]) . 

As A a is negligible with respect to E [T^ H ] [condition (|31[) ] , the factorization ([29]) 
follows. 

This argument is formalized through a number of technical results contained in 
the following lemmas. Let 

(35) Tp := inf{< > sEpZ^J : X { g a) (t) = i} , 
denote the first return time to £ after s E [T^ n ] , and 

(36) Si := ^ . 

These times are well defined: since, as we assume the chain to be positively recurent, 
every mean hitting time is finite. 



Lemma 3.4. Under the hypotheses of Theorem\ 
(37) lim P(5| > s + (3 a ) = 
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Proof. Define t a := sE T^ Ha , s a := \t a ], r a := [t a + A a \ and J := {s a , . . . ,r a }, 
where [x\ and \x\ denote respectively the floor and ceiling functions of x. Parti- 
tionning over all possible values of X^(s a ), we get 

P(S| > S + (3 a ) 

= p(xj a) (fc) ^l, Vfce I„) 
(38) = £ P (X (a) (fc) ^ * , Vfc g 4 | X, (a) ( Sa ) = z) P (x, (a) ( Sa ) = x) 



< 



sup p (x\ a \k) jtl,Vkei a x ( e a \ Sa ) = x) V p (X (a) ( Sa ) 

sup P > A Q ) , 



Hence, using Markov inequality, 

(39) P (5| > s + a ) < 

which tends to zero due to (l26l). 



sup xeAa E[T^] 



□ 



Lemma 3.5. Under the hypotheses of Theorem \3.3[ the following inequalities hold: 

(40) 

P (S (a) >s + t,Sf<s + /3 a ) > P (s (a) > s + f3 a , St < s + /3 a ) P (s (a) > i) 



and 



(41) P (V a) > s + t , 5f a < s + /3 a ) < P (.S (a) > s) P (S (a) > t - /3„) . 

Proof. We start with the following identity, obtained by partionning over all possi- 
ble values of Tf , 

P (s (Q) > s + 1 , Sf < s + /3 a ) 

= S P ( 5(Q) > * + * > 27 = ifc) 
fee/,, 

= p ( 5(a) > s + 1 ' 5(00 > s ' T z = k ) 

(42) = P ( S(a) > 3 + 1 S(a) >s,7% = k\ P (s (a) > s , T/ = fc) . 



fee /a 



The Markov property implies that 



(43) P (S (q) 



> s + t 



S {a) > s , Tf = k ) = P > s + 1 - 



(«) 



for all k G / a . The rightmost expression is bounded below by its value at k = s a , 

and bounded above by its value at k = s a + A a . Hence, 

(44) 

P (s (a) > tj < P (S (a) > s + t S (a) > s,Tf = k^j < P (S (a) > t - A,) . 
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(45) 



To prove (|40|1 we combine (|42|) with the leftmost bound in (|44|1 . We obtain: 

> s + t, SI < s + p a ) > P (s^ >t) P ( S{a) >s,Ti = k) 

kei a 

= ¥ (S (Q) > t) P (S" (a) > s , 31 < s + p a ) 
> P (S^ > t) P (s (a) > s + /3 a , 5| < s + p a ) 
Likewise, (|4Tj) follows from (|42| . the rightmost bound in (|44|) : 



(46) 



(s (o) > S + t , Sf < S + P a ) < P (s [a) > t - P a ) P ( 5(<l) > S,T e s = k 

kei a 

< P (s*( a ) > t - /3 a ) P (S (a) > s) . □ 



Lemma 3.6. Assume the hypothesis of Theorem \3.3\ and let S be the weak limit of 
a subsequence of S 1 -^ . Then, 



(47) 



'(S > s + t) = ¥(S > s) ¥(S > t) 



for all s and t such that s, t and s + t are continuity points of the distribution 
function of S. 

Proof. The previous lemmas remain valid for a varying through the subsequence 
index. For notational simplicity we keep using a for the latter and denote S^ the 
converging subsequence. Identity (|4T[) is proven by decomposing it into an upper 
and a lower bound. 



Proof of the upper bound. We write: 
(48) 
with 

.4 



(49) 
and 
(50) 

By 621, 



(S {a) > s + t) - P (s {a) >s)p (s ia) >t) = A a + B a 
¥ (S^ >s + t) -¥ (S^ > s) P (S {a) >t-p a ) 
B a = P (S {a) > s) P (S (a) >t-p a )-V (S (a) > t) 



A a < P (s (a) > s + 1) - P (S (a) > s + t, S% <s + p a ) 

(51) = P (S (Q) > s + t, Sf> s + p a ) . 
Thus, by Lemma |3.4[ 

(52) limsup^a < . 

a — >oo 

On the other hand, as t is a point of continuity and p a — ► [condition pip], the 
weak convergence of the sequence implies that 



(53) 



lim 

a— »oo 



(s^ > t - p a ) - ¥ (s (a) > t) 



= 
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Thus 

(54) lim B a = . 

From (|48p , (|52p. (|54p and the weak convergence of S( a ) we conclude that 

(55) P (S > s + 1) - P (S > s) P (5 > f) < 
Proof of the lower bound. This time we write 

(56) P (S (a) > s + tj - P (S* (a) > s) P (s (a) > t) = C a + D a 
with 

(57) C a = P (fifW > s + - P (S (a) > s + /3 a , Sf < s + f3 a j P (s^ > tj 
and 

(58) D a = P (s (a) > i) P (5 (Q) > s + p a , Sf < s + f3 a ) - P (S {a) > s) 
By m, 

(59) C a > P f S (a) > s + 1 , S| > s + p a ) > 

\ / a — ^oo 



where the convergence is due to Lemma 13.41 We further decompose D a by sub- 

stracting and adding P (S^ > s + fl a ) ■ We obtain 

(60) 

D a = -P (S (Q) >a + 0a,S;>s + I3 a j + [P (S^ >s + [3 a )-P (s (q) > s) 



Both terms converge to zero; the first one by Lemma 13.41 and the second by conti- 
nuity at s and condition pip . Therefore, 

(61) lim D a = . 

a — >oo 

Because of the weak convergence of , equations (fS"6"|) , JHH) and (|6ip imply that 

(62) P (S > s + 1) - P (S > s) P (S > t) > . □ 



Proof of Theorem 13.31 It follow from the preceding lemma and some general con- 
siderations. We first remark that the sequence is uniformly tight because 
E[S^] = 1. Indeed, using Markov inequality, 



(63) P (S* (a) > tj 



E \S^] 1 
> t) < — L - ] - = ► 

t t t^oc 



It follows (see, e.g., Theorem. 4.4.3. in [21]) that admits a sub-sequence S^" ' 
that converges in law. We only have to prove that there exists a unique limit law, 
namely £xp(l). 

Let S be the limit of one of these subsequences . By Lemma 13.61 
(64) P (S > s + 1) = P (S > s) P (S > t) , 

for all continuity points of the distribution function F$ of S. These discontinuity 
points are dense in R because F s , being increasing, has at most a denumerable 
number of discontinuities. Thus, (|64|) holds for all s and t. 

It remains to show that E [S] = 1 to prove that S can have only one law, namely 
exponential with mean 1. It is here that we invoke the hypothesis of uniform 
integrability of the sequence S^ a \ Indeed, by Theorem 5.4 of [ID] , the fact that 
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S( a ) is a weakly convergent sequence of uniformly integrable mean-one random 
variables, implies that 

(65) E [S] = lim E^ ")] = 1 . □ 

a — >oc 

4. Mean hitting times for birth-and-death processes 

The estimation needed to apply the criteria of the previous section to birth-and- 
death processes relies on explicit formulas of mean hitting times available for these 
processes. Throughout this section, to shorten formulas we adopt the conventions 
IT; : ■■= 1 and V;j ; := 0. 

We consider an irreducible birth-and-death process X on a finite interval [o, aj C 
Z with transition rates < p(x, x + 1) =: p x , b < x < a — 1;0< p(x, x — 1) =: q x , 
b + 1 < x < a, and r x :— 1 — p x — q x . The invariant measure for such a process is 
also reversible and can be written in the forms [c.f. |3j)]: 

(66) jr<°>(a:) = f[ *±„M(b)= f[ -^-v™ (a) 

t , -t Qi i i Pi-l 

1=6+1 i= x +l 

for b < x < a. Hence, for k < I, 

Pi-i t~t q l »rW(0 



(^) n T = n f: 

i=k+l H% i=k+l 



I (a) 

n V: 1 7r(a) ( fc ) ' 



Proposition 4.1. For the preceding birth-and-death process, the following identities 
hold: 

(1) For b < n < j < a, 
(68) E[T^ n ] = 



fc=n+1 % 7T(fc) 

(69) E[T^„] = £ ^^^ E [T^„]7r(0 - E[T^ n ] . 
(2) For b < j < n < a, 

Pkir(k) 
n — l fe 

(71) E [TjL»] = 2 £ E[7U " ] ^ W - E[T ^" ] 

n 1 

(72) E[T,^„]+E[T^,] = Y, ^ik V 



(3) For b<m<j<n<a, 
(73) E [T^ {mM ] 



E [T„] E [Tj^ n ] - E [T„] E [T„^-] 



E [T TO _„] + E [T„^ m ] 

F. [T___ ._ 

(74) 



E [T m ^„] E [Tj_> m ] - E [T n ^ TO ] E [T ro ^-] 



E [T„] + E [T n . 
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Proof. We apply the standard technique of establishing a difference equation for 
consecutive mean times that is solved by iteration (see for instance |33j chapter 
XIV). In all cases, the equation is obtained from the following decomposition. 

Claim: Let /C C [a. £>J and j ^ /C, b < j < a. Then, for any function F, 
(75) 

E[F(Tj^ K )] = pjElFiTj+^ + l^+qjElFiT^^ + l^+rjElFiT^ + l)] . 
with the conventions T a+ i_^/c, T^-i^k, : = an d , Ti^K '■— if i G fC. 
Indeed, decomposing according to the first step of the process, 



E[F(T^ K )] 
(76) 



Pj E F(T^ K ) X(l) =j + l + q 3 E F(T^ K ) X(l) = j - 1 



■ rj E 



F{Tj-K + 1) 



In the three cases i = j — 1, j, j + 1, 



X(l) 



k>l 

= ^F(fc)P(T i ^ = fc-l) 
fe>i 

= ^F(fe + l)P(T i ^ X ; = fe) 



X(l) 



(77) 



fe>0 

= E 



[F(T^ K + 1) 



The second equality is due to Markovianness. The claim follows from (|76j) and ([77| . 

froo/ o/ (E2|j and Applying (JT5J) for F(T) = T and /C = {n} we arrive to the 

difference equation 



(78) E|T, 
valid for j 

(79) 



6+1, 



L J+1 ^" J Pj+ffj L J * "" J ' Pj+q 3 
, a — 1 and subject to the boundary conditions: 

E[T b ^ n ] = E[T b+1 ^ n ]+- 

Pb 



E[T n ^ n ] = 

E[T a _„] = E[T Q _ 1 ^„]+ — 

q a 



To prove (|68[) it is useful to rewrite ([78)1 in the form 
(80) D k - D fe _i = B k {D k+1 - D*) + e fe 



with Dj 



Pj/qj and tj = l/qj- This equation is easily solved by 



iteration, yielding 

(81) D k - Dfe_i = J[ B t (D a - D a -!) +^2eiJ[Bi, 
and, thus, 

(82) Dj = D a 



a-l l-l 



l=k 



a a — 1 a— 1 I — 1 

^ (n Si (D a -£>„_!) +5>n^ 
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Upon application of the last two conditions in (|79|) : 

D a - D a -i = e a 

(83) D n = 0, 
we finally obtain 

(84) d 3 = e i>ns t , 

k— n+1 /— fc i— k 

that is, 



(85) E[T^„] = E II 



Pi-i 



k=n+l ^ k l = k i=k+l " % 



This is, in fact, expression due to identity (|6"7[) . 

The proof of ([70)) is analogous, but inverting the direction of iteration. We write 
(1781) in the form 



(86) £> fe+1 - £ fc - B k (D fc - Dfc-!) - ? fc 
with = qj/pj and ej = 1/pj. The solution is now 

j-l k k k 

(87) Dj = D b + E( II 5i (^H-i - A) - E ?i II S ' 

i=6+l 2=6+1 i=J+l 

The first two conditions in ([79)) : 

Db+i — Db = —eb 

(88) D n = 0. 
lead to 

(89) ^=EE^n^- 

fc=j l=b i=l+l 

In terms of the original quantities this is equal to 

oo) e [Tj^n] = E^rE n 



A;— j /— i— / + ! 



which, together with f67|). leads to fTOj). 

Proof o/ JZGJ) arwi (Tip. We apply for F(T) = T 2 and JC = {n} to arrive to the 
difference equation 

(91) E[T? J = E[T2 , 1 + — L E r T 2 1 + 2E i T ^] - 1 

L ° J ft' + L J+ J ft + QJ 1 J J ft + 13 

j G [6 + 1, o — 1], plus the boundary conditions: 

Pb 

(92) E[T„%J = 

2E[T a ^ n ] - 1 



e[t 2 __] = E[r a 2 _^„] 
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If we now substitute Dj = E[Jf_^ n ], Bj = Pj/qj, Bj = qj/pj, e 3 = (2E [ij-m] - 
l)/<7j and Zj = (2E[7j_ m l — we recover equations (1501) and (156) and the 

boundary conditions ([55)1 and (J55J. The corresponding solutions (JSJ) and 
yield 



(93) E[TjL„] - 



E ^E( 2E ^-]-l) n V &<n<i<<* 

— n+l t=fe z— fe+1 

n+1 1 jfe jfc 



, . Pk . , ^ 1 . . , , Pi-j 



The first and second lines become, respectively, (|6U)l and (fTTj) upon resorting to 

USB and dHSD/dnOD- 

Proof of $7^). From (|68|) , ([70]) and the reversibility relation Tr(x)p x — tt(x+1) q x +i, 
we have: 



(94) E[T^„] +E[T n ^] - ]T 



TTdM-lD+TTdM 



oi. 7r(fc) 

fe=j+i HK v ; 

which is, precisely, (|72|) because 7r([6, aj) = 1. 

Proof o/ (7|) and f7$. The application of ([75|) for F(T) — T and K. = {m,n} 
leads to the difference equation (|78[) with n replaced by {m,n}. The equation is 
valid for m < j < n and is supplemented by the boundary conditions 

(95) E[Tn-m] = = E[T m ^ m ] . 

The substitutions = E[Tj^{ m n j], Bj — pj/qj and ej = 1/qj lead to the same 
identity (|80|) which, iterated from k — n downwards, yields 

n to— 1 to— 1 i— 1 

(96) Dj = D n - j2 ( n Bi - + e e; n s ■ 

fe=j-|-l i— l—ki—k 

Once imposed the condition £) n = 0, the condition, the solution takes the form 

(97) Dj = D n ^Fj-Gj , 
with 



n n— 1 



(98) Fj = E IT 5 ' 

fc— j + 1 i=fc 

n n — 1 2—1 

(99) = £ 5>TT^ 

fc^j + l l—k i—k 

The remaining condition D m = implies, then, 

G r , 

Replacing Bi = Pi/qi, €j = 1/qj and using ([67)) we get 
(101) Fj = V ^ 



(100) Dj = -^Fj- Gj 
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and 

7r([fc,n-l]) 

k=j+l y V ' 

/ 102 s = 7r([fc,a|) -7r([n,g]) 

We now resort to ([55]) and ([72")) to conclude: 

(103) Fj = q n 7r(n) [E [T^„] + E [T n ^]] , 

(104) Gj = E [T n ^] - 7r([n, a]) [E [T^„] + E [T„^-]] . 

Identity (|73[) is obtained by replacing these two equalities (and their j = m version) 
into poop . Identity ([74")) can, of course, be obtained in a similar fashion iterating 
(155)) up from k = m. Alternatively, it follows from the fact that the numerators of 
(175)) and ([74")) are equal because their difference is 

E [T n ^ m ] (E [T m ^ 3 ] + E [T^ n ]) - E [T m ^ n ] (E [T„^-] + E [T^ m ]) 

(105) = E [T„] E [T m ^„] - E [T m ^„] E [T n ^ m ] = . □ 



Next we list two useful consequences of the previous proposition. 
Corollary 4.2. Consider the process of Proposition [7^7) Then, for b + 1 < x < a, 

(106) E[Tj x ] = 2 ^tt(M^ _ E[r j 
and, /or b < j < n < a, 

(107) E[T^J < 2E[T^„] 2 + 2E[T b ^]E[T^„] -E[T^„] . 

Proof. Display (|106|) is an immediate consequence of ((M)) and (JBSJ. Inequality 
(fTU7|) is obtained from (JTTJ> and ([70)): 

E[3jLn] < 



< 

(108) 



E — 77tE( E [ T ^+EM^) - E K-«] 

E E E ^ <0 + 2 E K-n] 2 - E [ T i-n] 

fe=j l=b 

2E[T fe ^] V ^# + 2E[T,_] 2 - E[T^„] 
2 E [T b ^l E [T^J + 2 E [T^J 2 - E [T^J . □ 
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5. Cut-off and escape for half-well models 



We shall proof parts 2 and 3 of Theorem 12.31 in two stages. In this section we 
consider the half-well models, obtained when setting b = and go — or a — and 
Po = 0. The actual result is a little stronger than the statement of Theorem 12. 31 It 
is based on the quantities 

(109) Q (a) (*):= * „ E ^WM 3 ; Qia): = SUP Q(a)(x) - 
As 

(110) 7r(°>([l,o])QW(l) <g (Q) (l) < Q {a] < K 2 a K q , 
the following lemma implies parts 2 and 3 of Theorem 12.31 

Lemma 5.1. Let be a family of irreducible birth- and- death chains on [0, a], 
(i) If 

QW(1) 
E[tL„ 



(HI) t (o) ([!,«]) >0 



and, as a consequence, the random variables 
Tg^ a have escape-time behavior at mean times. 
(ii) If 

(112) — ^ .0, 

£/ien Var(T^ /E[T^ ] ) — ► as a — > oo and, as a consequence, the 
random variables T^lo have cut-off behavior at mean times. 

Proof, (i) We have to verify that both conditions in Definition 13.21 are valid for 
t = 0, 7Y a = {a}; escape behavior follows then from Theorem 13.31 The second 
condition is a consequence of (|10T|) which for the half well implies that 

(113) E[(T ( ^ a ) 2 ] < 2E[T W a ] 2 . 

Therefore the sequence = 7of!; /E [T *4 o ] is uniformly square integrable and 
thus, by Schwartz inequality, uniformly integrable. 

The first condition amounts to proving that E [T„^ ] /E [To^J — > as a — > oo 
or, equivalently, that 

(114) , ffigj , , .0. 



By (1681) and (|72|) this corresponds to 

y^TT^M) 

^ <h> 7r (a) (a:) 

(H5) rW := ^ 

V - 

^ 9s ^ (a) (a) 
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To prove this we consider the probability measure on [0, a] defined by expectations 

^ q x 7r( a ) (x) 
(116) £{F) := 5=i . 

V - 

Indeed, the inequality £ (F) 2 < E(F 2 ) for F(x) = -K {a) {fx, a}) implies that 1 

y ^ (Q) ([z,"l) 2 ^n^HMf 

(U7) r(B , < S fc^W Sfc^C*) 



y^ TrWgar.ol) E[t£> ] 
^ g*7r(«)(a:) 



Thus, 



(118) F (a) < 7rW([l,aP Q(o) (l) 



and (|115p is a consequence of hypothesis (lllip . 

(ii) Due to the independence of the hitting times and to (|106|) . 

Var(TW ) = £var(r£Li) 



2 



^i qx n ^ i^L qc n ^ 

Hence, by (68]), 

/ T (a) „ \ 0(°) 
(120) Var( V? ) < r >0 - 

The cut-off behavior is, thus, a consequence of Proposition 13. 11 □ 

6. CUT-OFF AND ESCAPE BEHAVIOR FOR THE FULL WELL 

6.1. Comparison (in)equalities. Part of the results will be obtained by com- 
parison with the half-well hitting times. Let us therefore define, for each process 
X<~ a \ its left and right X^ half-well versions. The latter is a birth-and-death 
process on [0, a] with transition rates {px,qx,r x ) = (px^Qxifx) f°r 1 < x < a and 
(Po,qo,ro) — (po,0,qa + r ). Likewise, X^ is a process on [6, 0J with transition 
rates (p x ,q x ,f x ) = (p x ,q x ,r x ) for b < x < -1 and (po,qo,f ) = (0, q 0l p + r ). 
From ^ we see that the left or right invariant measure differ from the bilateral 
one only in a normalization constant: 

{n^ (x) 
, ; x e lb, oi 
tt(-) (y) 
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The behavior for the full well is proven on the basis of the following comparison 
inequalities. 

Proposition 6.1. Let be a birth- and- death process and X^ a \ its left and 
right half-well versions. Then, 



(122) E[TW J ]=E[Ti!iJ 

(123) E[TWj =E[fW ] 
and 

(124) E[T<1] 



, T (o) - / / 

Varf | = Varf a "~ >0 ] 



(a) 



Varf r& 7" ) = Varf 



(a) 



TrW^-lDEff^J+E^, 



Ma) 



7r(«)([0,a]) 

7r(°)([l,a])E[r^ ]+E[r^ 6 
7r(«)([6,Ol) 



> 



= E[7gj v E[7gJ < 

a ' e[tW] e[tW] - ' 



(125) E[TW] 
Furthermore, if 

(126) 

i/ien 

(127) (l ~ 2 2Ca) E[TW a ] AE[rW] < E[TW {8i6} ] < E[T&] AK[rWj . 

Proof. The identities in ||122|) and (|123p are an immediate consequence of (|551) - (fTTj) 
and (fT2"Tj) . 

Proof of {7gp and From ([7DJ), ft2) and pgjl . 



E[T WJ = J] 



*■([&,*]) 



Pfc 7r(fc) 

= _l])+*([ ,fc])]-i 



fc=0 
a-1 



p fc 7r(fe) 



*<fo-u) 



(128) = tt([6, -1]) (E[T o ( !l ] +E[T W o ]) +E[f ^ a ] . 

Passing tt([6, -1]) E[T ( ^ Q ] to the left-hand side we arrive to (fTM|) . The proof of 
(I125P is analogous. 

Proof of [12ty . The upper bound is an immediate consequence of the fact that 
T^ {a b} <To^ a , T^ b . For the lower bound we observe that from (J73J), 

(129) ' 



E 



T, 



(a) 



0— >{a,b} 



E 


T (a) 




± a^b 




E 


71(a) 


+ E 


T (a) 


^b^a 


± a^b 



-E 



T, 



(a) 



E 


71(a) 




J-b^a 




E 


71(a) 


+ E 


71(a) 
^a^b 


± b^a 







ABRUPT CONVERGENCE AND ESCAPE BEHAVIOR FOR BIRTH AND DEATH CHAINS23 



Hence, if A a := E 



T 



(«) 



/[E 



T, 



(a) 



E 



T 



(«) 



E[T ( !> {0 , 6} ] > E[T^ a ][X a -C a ] 



(a) 



(130) > E r ( : 

Analogously, using (f?4")) we obtain 

(131) 4 T cS { «,6}] > E[t£> ] A E[T W ] [(1 - A„) - C„] 
Adding ([HQ)) to (filiT]) we obtain the lower bound in (THT)) . 



□ 



6.2. Proof of cut-off and escape behavior for the full well. In this section 
the notation "o(l)" stands for a positive term that goes to zero as a goes to infinity. 



Cut-off behavior ofT^l and T^ g . The laws of these hitting times depend only on 
half- well trajectories, therefore their cut-off behavior has been proven in Section [5] 
More formally, let us prove the cut-off behavior of T^ , the proof for T^ is 
analoguous. 

We suppose that has a strong drift to the left (0 + -SD). We thus have 

K 2 JE[T^ ] >o. 

a — >oo 

As a consequence of (I121[) the quantities K a and do not depend on the 
left part of the well (that is: K a = K a and = Q^). Thus, by (Tm)) and 
since < K\ K q (see (|110l) ). we can apply part (ii) of Lemma ISTTl to the half- 

well version X^ a \ to conclude that Var(ri^ /E[ri^ ] ) — > as a -> oo. Cut-off 
follows from Proposition 13. II 



Escape behavior of Tg°^ a and T^±. We consider T^ a , the proof for is anal- 



ogous. We apply Theorem 13.31 by proving the validity of conditions 1 and 2 of 
Definition 13.21 for £ = and TL a = {a}. Condition 1 requires that 

(132) 1 a 01 V L fc ^ oJ • n 



E[T^ a ] E[T ( (a) 



0— >al 



But the left identity in (| 1 22(1 and the inequality in (|124D imply that 

( a ) i wir-H i wi>f( a ) 



E[lgo] _ E[T^ ] E[T C 



(133) = -Httt < 



a^O I 



E[t£> ] E[T ^ a ] E[f t (a) 



which tends to zero as a diverges because of part (i) of Lemma lOl (using again Q12ip 
and piOp as in the previous proof). On the other hand, the leftmost hypothesis in 
(TIT)) guarantees that ERr^J /E[r o ^ a ] tends to as a goes to infinity. This proves 
(|132p . Condition 2 follows from (|107p which implies that 

E[(T,S Q ) 2 ] < E[2]&] 



(134) = 2 + o(l). 
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The second line is, again, due to the leftmost hypothesis in (fT?]) . This proves that 
the sequence T^ a /E [T ^ a l is uniformly square integrable and therefore uniformly 
integrable. 

Escape behavior of b ^ . We verify conditions 1 and 2 of Definition 13.21 for 

I = and TL a = {a, b}. For the former we must consider the ratio 

(135) flW •= E [ r ^o] VlE [ r ^°] 

By the lower bound in (|127[) , 

136 R [a) < 1 H LJ H 2i ■ 

- i-2C aE [r (») o ] AE [ To W] 

As seen in CEM}: E[T a ( !> ] /E[T ( ^ a ] < E[f^o] /E[T ( 3 o ] which tends to zero 
by part (i) of Lemma 15.11 Analoguously, using the left identity in (|123p and the 
inequality in (|125[) . we get that E[T fc ( ^ ] /E[T^ b ] tends to zero, and as a conse- 
quence C a also tends to zero as a — > oo. 

Condition 1 is proven, since by hypothesis (H7J) E[T Q ( !i ] / E[T ( ^ b ] and E[T b ( ^ Q ] / E[T^ a 
tend to zero. 

To prove condition 2, we use the fact that (T^ {aM ) 2 < {T^ a ) 2 , (T^ b ) 2 and 



the lower bound in (|127p to obtain 

E [(^ {a , 6} ) 2 ] ^ / 2 \ 2 E[(^ tt f]AE[(lil) 



E[T tV b} ] 2 " V1-2CJ E[r<l] 2 AE[T<!> 6 ] 2 

2 \ 2 E[(T W o ) 2 ] w E[(T W) 2 



(137) < 



1-2CJ E[r W Q ] 2 E [t o W] 2 



where the last line is due to (|134p and its analogous for a <-> 6. This proves the 
uniform square integrability — and, in consequence, the uniform integrability — of 
the sequence T^ {ab} /E [T^ b} ] . □ 

Remark 6.2. TTie same argument used in H133\) show that the conditions 



(138) .0 , ~ a ,"!° J >0. 

are sufficient for the validity of |17[ ). TTizs requirement is weaker than j!8\) . 



7. Proof of part 1 of Theorem 12.31 Equality of the laws of direct 

and reverse trajectories: 

This property is a direct consequence of the reversibility of the invariant measure, 
and is unrelated to the particular behavior of hitting times. While we do not doubt 
that the result is well known, we have not found it spelled out in the literature. For 
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completeness we are presenting a detailed proof. The property stated in Theorem 
is a particular instance of the following proposition. 



Proposition 7.1. Let us consider a recurrent Markov process on a finite alphabet 
A defined by transition probabilities p(u, v) with invariant reversible measure n. 
Then, for any x, y € A with ir(x), 7r(y) ^ any natural k, 

(139) P (f x ^ y =kj = ¥ (f y ^ x = fc) 

where T x ^ y is the time needed to hit y after hitting x (defined in Section \2.J\ ). 

Proof. It is decomposed in two claims, the first of which is adapted from |50j . Let 
v k = (vi, . . . ,Vf.) denote a trajectory of length k, let V£ denote the set of all 
trajectories of length k "strictly between x and y" and let U x ^ y be the set of all 
finite such trajectories: 
(140) 

V^ y := {v k :Vl =x;v k = y; Vi ^x,y,l<i<kj , U x ^ y := |J V^ y . 

k<oo 

Let X z denote the Markov process started at z G A Following [50], we say that 
X z starts as v k if for all i = 1, . . . , k, X z (i — 1) = Vi, and for a set Vc^l, that 
X z starts in V if X z starts as v k for some v k £ V. Let 7Z denote the time-reversal 
operator on the space of finite trajectories: lZ(v k ) = (v k , ■ ■ ■ ,Vi), 

Claim 1: 

P (x x starts in V k ^ y X x starts in U x ^ y ^j 

(141) = P (x y starts in TZ(V k _ y ) X y starts in TZ{U x ^ y fj . 
Indeed, by reversibility, 

7t(x)V(X x starts as/) = ir(x)p (x, v 2 ) p(v 2 , v 3 ) ■ ■ ■ p(v k . 2 ,v k -i) p(v k -i,y) 
= p(v 2 ,x) p(v 3 ,v 2 ) ■ ■•p(^.i,u fe . 2 ) p(y,v k -i) 7r(y) 

(142) =TT(y)F(X y starts as TZ(v k )) . 

Summing over v k € V x -> y , we get ir(x) P (X x starts in V k ^ y ) = ir(y) P (X y starts as TZ(V k ^ y )) 
and thus, summing over k, 

(143) n(x) P (X x starts in U x ^ y ) = n(y) P (X y starts in K(U x ^ y )) . 
Dividing each side of (I142|) by the corresponding side of (I143|) we arrive to (jl41[) . 

Claim 2: 

(144) P (T x _ y = k^j = P (X x starts in V k ^ y \ X x starts in U x ^ y ) , 



(145) P [T y ^ x = kj = P (X v starts in TZ{V k ^ y ) \ X y starts in K(U X ^ V )) . 

(x) 

Let t x := Tx->x,y be the time of last visit to x before hitting y, and for n G N let us 
denote 9 n : N — > N the "forward time-translation" 9 n (t) = t + n. Partitioning over 
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all possible values of t x and using the Markov property we obtain 
P (f x ^ y =k)=J2 F (ja = n; X x o 6 n starts in V*_^) 

n 

= ^ P(jf a (t) 7^ 0, t < n; X x (n) = a; A x o 6>„ starts in V*^, 

71 

(146) = F(X X starts in V£_^) ^ v(x x (t) ^ 0, < < n; X x {n) = a 



Summing over k, we get 

(147) 1 = J2 p (*«(*) ^ 0. i < »! = a ) 2 P ( X - starts in V ^v) ; 

n fc 

therefore 

(148) J2p(x x (t) ^0,t<n; X x (n) = a) = — 

~ v ' PfX s starts in V|_^ 

The combination of (|146|) and (|148[) yields (|144|) . The proof of (|145[) is analogous. 

□ 
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